Journal article
Disambiguation of biomedical text using diverse sources of information
M Stevenson, Y Guo, R Gaizauskas, D Martinez
BMC Bioinformatics | Published : 2008
Abstract
Like text in other domains, biomedical documents contain a range of terms with more than one possible meaning. These ambiguities form a significant obstacle to the automatic processing of biomedical texts. Previous approaches to resolving this problem have made use of various sources of information including linguistic features of the context in which the ambiguous term is used and domain-specific resources, such as UMLS. Materials and methods: We compare various sources of information including ones which have been previously used and a novel one: MeSH terms. Evaluation is carried out using a standard test set (the NLM-WSD corpus). Results: The best performance is obtained using a combinati..
View full abstractGrants
Awarded by Engineering and Physical Sciences Research Council